16 Publications

Mark all

[16]
2024 | Conference Paper | LibreCat-ID: 56004 | OA
Neumann, Thilo von, Christoph Boeddeker, Tobias Cord-Landwehr, Marc Delcroix, and Reinhold Haeb-Umbach. “Meeting Recognition with Continuous Speech Separation and Transcription-Supported Diarization.” In 2024 IEEE International Conference on Acoustics, Speech, and Signal Processing Workshops (ICASSPW). IEEE, 2024. https://doi.org/10.1109/icasspw62465.2024.10625894.
LibreCat | Files available | DOI
 
[15]
2023 | Journal Article | LibreCat-ID: 35602 | OA
Neumann, Thilo von, Keisuke Kinoshita, Christoph Boeddeker, Marc Delcroix, and Reinhold Haeb-Umbach. “Segment-Less Continuous Speech Separation of Meetings: Training and Evaluation Criteria.” IEEE/ACM Transactions on Audio, Speech, and Language Processing 31 (2023): 576–89. https://doi.org/10.1109/taslp.2022.3228629.
LibreCat | Files available | DOI
 
[14]
2023 | Conference Paper | LibreCat-ID: 48275 | OA
Neumann, Thilo von, Christoph Boeddeker, Marc Delcroix, and Reinhold Haeb-Umbach. “MeetEval: A Toolkit for Computation of Word Error Rates for Meeting Transcription Systems.” In Proc. CHiME 2023 Workshop on Speech Processing in Everyday Environments, 2023.
LibreCat | Files available | Download (ext.)
 
[13]
2023 | Conference Paper | LibreCat-ID: 54439 | OA
Boeddeker, Christoph, Tobias Cord-Landwehr, Thilo von Neumann, and Reinhold Haeb-Umbach. “Multi-Stage Diarization Refinement for the CHiME-7 DASR Scenario.” In 7th International Workshop on Speech Processing in Everyday Environments (CHiME 2023). ISCA, 2023. https://doi.org/10.21437/chime.2023-10.
LibreCat | DOI | Download (ext.)
 
[12]
2023 | Conference Paper | LibreCat-ID: 48281 | OA
Neumann, Thilo von, Christoph Boeddeker, Keisuke Kinoshita, Marc Delcroix, and Reinhold Haeb-Umbach. “On Word Error Rate Definitions and Their Efficient Computation for Multi-Speaker Speech Recognition Systems.” In ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE, 2023. https://doi.org/10.1109/icassp49357.2023.10094784.
LibreCat | Files available | DOI | Download (ext.)
 
[11]
2022 | Conference Paper | LibreCat-ID: 33954 | OA
Boeddeker, Christoph, Tobias Cord-Landwehr, Thilo von Neumann, and Reinhold Haeb-Umbach. “An Initialization Scheme for Meeting Separation with Spatial Mixture Models.” In Interspeech 2022. ISCA, 2022. https://doi.org/10.21437/interspeech.2022-10929.
LibreCat | DOI | Download (ext.)
 
[10]
2022 | Conference Paper | LibreCat-ID: 33958
Kinoshita, Keisuke, Thilo von Neumann, Marc Delcroix, Christoph Boeddeker, and Reinhold Haeb-Umbach. “Utterance-by-Utterance Overlap-Aware Neural Diarization with Graph-PIT.” In Proc. Interspeech 2022, 1486–90. ISCA, 2022. https://doi.org/10.21437/Interspeech.2022-11408.
LibreCat | DOI
 
[9]
2022 | Conference Paper | LibreCat-ID: 33819 | OA
Neumann, Thilo von, Keisuke Kinoshita, Christoph Boeddeker, Marc Delcroix, and Reinhold Haeb-Umbach. “SA-SDR: A Novel Loss Function for Separation of Meeting Style Data.” In ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE, 2022. https://doi.org/10.1109/icassp43922.2022.9746757.
LibreCat | Files available | DOI
 
[8]
2022 | Conference Paper | LibreCat-ID: 33847 | OA
Cord-Landwehr, Tobias, Thilo von Neumann, Christoph Boeddeker, and Reinhold Haeb-Umbach. “MMS-MSG: A Multi-Purpose Multi-Speaker Mixture Signal Generator.” In 2022 International Workshop on Acoustic Signal Enhancement (IWAENC), 2022.
LibreCat | Files available | arXiv
 
[7]
2022 | Conference Paper | LibreCat-ID: 33848 | OA
Cord-Landwehr, Tobias, Christoph Boeddeker, Thilo von Neumann, Catalin Zorila, Rama Doddipatla, and Reinhold Haeb-Umbach. “Monaural Source Separation: From Anechoic to Reverberant Environments.” In 2022 International Workshop on Acoustic Signal Enhancement (IWAENC). Bamberg: IEEE, 2022.
LibreCat | Files available | arXiv
 
[6]
2022 | Misc | LibreCat-ID: 33816 | OA
Gburrek, Tobias, Christoph Boeddeker, Thilo von Neumann, Tobias Cord-Landwehr, Joerg Schmalenstroeer, and Reinhold Haeb-Umbach. A Meeting Transcription System for an Ad-Hoc Acoustic Sensor Network. arXiv, 2022. https://doi.org/10.48550/ARXIV.2205.00944.
LibreCat | Files available | DOI
 
[5]
2021 | Conference Paper | LibreCat-ID: 26770 | OA
Neumann, Thilo von, Keisuke Kinoshita, Christoph Boeddeker, Marc Delcroix, and Reinhold Haeb-Umbach. “Graph-PIT: Generalized Permutation Invariant Training for Continuous Separation of Arbitrary Numbers of Speakers.” In Interspeech 2021, 2021. https://doi.org/10.21437/interspeech.2021-1177.
LibreCat | Files available | DOI
 
[4]
2021 | Conference Paper | LibreCat-ID: 29173 | OA
Neumann, Thilo von, Christoph Boeddeker, Keisuke Kinoshita, Marc Delcroix, and Reinhold Haeb-Umbach. “Speeding Up Permutation Invariant Training for Source Separation.” In Speech Communication; 14th ITG Conference, 2021.
LibreCat | Files available
 
[3]
2020 | Conference Paper | LibreCat-ID: 20762 | OA
Neumann, Thilo von, Keisuke Kinoshita, Lukas Drude, Christoph Boeddeker, Marc Delcroix, Tomohiro Nakatani, and Reinhold Haeb-Umbach. “End-to-End Training of Time Domain Audio Separation and Recognition.” In ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 7004–8, 2020. https://doi.org/10.1109/ICASSP40776.2020.9053461.
LibreCat | Files available | DOI
 
[2]
2020 | Conference Paper | LibreCat-ID: 20764 | OA
Neumann, Thilo von, Christoph Boeddeker, Lukas Drude, Keisuke Kinoshita, Marc Delcroix, Tomohiro Nakatani, and Reinhold Haeb-Umbach. “Multi-Talker ASR for an Unknown Number of Sources: Joint Training of Source Counting, Separation and ASR.” In Proc. Interspeech 2020, 3097–3101, 2020. https://doi.org/10.21437/Interspeech.2020-2519.
LibreCat | Files available | DOI
 
[1]
2020 | Conference Paper | LibreCat-ID: 20766 | OA
Kinoshita, Keisuke, Thilo von Neumann, Marc Delcroix, Tomohiro Nakatani, and Reinhold Haeb-Umbach. “Multi-Path RNN for Hierarchical Modeling of Long Sequential Data and Its Application to Speaker Stream Separation.” In Proc. Interspeech 2020, 2652–56, 2020. https://doi.org/10.21437/Interspeech.2020-2388.
LibreCat | Files available | DOI
 

Search

Filter Publications

Display / Sort

Export / Embed

16 Publications

Mark all

[16]
2024 | Conference Paper | LibreCat-ID: 56004 | OA
Neumann, Thilo von, Christoph Boeddeker, Tobias Cord-Landwehr, Marc Delcroix, and Reinhold Haeb-Umbach. “Meeting Recognition with Continuous Speech Separation and Transcription-Supported Diarization.” In 2024 IEEE International Conference on Acoustics, Speech, and Signal Processing Workshops (ICASSPW). IEEE, 2024. https://doi.org/10.1109/icasspw62465.2024.10625894.
LibreCat | Files available | DOI
 
[15]
2023 | Journal Article | LibreCat-ID: 35602 | OA
Neumann, Thilo von, Keisuke Kinoshita, Christoph Boeddeker, Marc Delcroix, and Reinhold Haeb-Umbach. “Segment-Less Continuous Speech Separation of Meetings: Training and Evaluation Criteria.” IEEE/ACM Transactions on Audio, Speech, and Language Processing 31 (2023): 576–89. https://doi.org/10.1109/taslp.2022.3228629.
LibreCat | Files available | DOI
 
[14]
2023 | Conference Paper | LibreCat-ID: 48275 | OA
Neumann, Thilo von, Christoph Boeddeker, Marc Delcroix, and Reinhold Haeb-Umbach. “MeetEval: A Toolkit for Computation of Word Error Rates for Meeting Transcription Systems.” In Proc. CHiME 2023 Workshop on Speech Processing in Everyday Environments, 2023.
LibreCat | Files available | Download (ext.)
 
[13]
2023 | Conference Paper | LibreCat-ID: 54439 | OA
Boeddeker, Christoph, Tobias Cord-Landwehr, Thilo von Neumann, and Reinhold Haeb-Umbach. “Multi-Stage Diarization Refinement for the CHiME-7 DASR Scenario.” In 7th International Workshop on Speech Processing in Everyday Environments (CHiME 2023). ISCA, 2023. https://doi.org/10.21437/chime.2023-10.
LibreCat | DOI | Download (ext.)
 
[12]
2023 | Conference Paper | LibreCat-ID: 48281 | OA
Neumann, Thilo von, Christoph Boeddeker, Keisuke Kinoshita, Marc Delcroix, and Reinhold Haeb-Umbach. “On Word Error Rate Definitions and Their Efficient Computation for Multi-Speaker Speech Recognition Systems.” In ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE, 2023. https://doi.org/10.1109/icassp49357.2023.10094784.
LibreCat | Files available | DOI | Download (ext.)
 
[11]
2022 | Conference Paper | LibreCat-ID: 33954 | OA
Boeddeker, Christoph, Tobias Cord-Landwehr, Thilo von Neumann, and Reinhold Haeb-Umbach. “An Initialization Scheme for Meeting Separation with Spatial Mixture Models.” In Interspeech 2022. ISCA, 2022. https://doi.org/10.21437/interspeech.2022-10929.
LibreCat | DOI | Download (ext.)
 
[10]
2022 | Conference Paper | LibreCat-ID: 33958
Kinoshita, Keisuke, Thilo von Neumann, Marc Delcroix, Christoph Boeddeker, and Reinhold Haeb-Umbach. “Utterance-by-Utterance Overlap-Aware Neural Diarization with Graph-PIT.” In Proc. Interspeech 2022, 1486–90. ISCA, 2022. https://doi.org/10.21437/Interspeech.2022-11408.
LibreCat | DOI
 
[9]
2022 | Conference Paper | LibreCat-ID: 33819 | OA
Neumann, Thilo von, Keisuke Kinoshita, Christoph Boeddeker, Marc Delcroix, and Reinhold Haeb-Umbach. “SA-SDR: A Novel Loss Function for Separation of Meeting Style Data.” In ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE, 2022. https://doi.org/10.1109/icassp43922.2022.9746757.
LibreCat | Files available | DOI
 
[8]
2022 | Conference Paper | LibreCat-ID: 33847 | OA
Cord-Landwehr, Tobias, Thilo von Neumann, Christoph Boeddeker, and Reinhold Haeb-Umbach. “MMS-MSG: A Multi-Purpose Multi-Speaker Mixture Signal Generator.” In 2022 International Workshop on Acoustic Signal Enhancement (IWAENC), 2022.
LibreCat | Files available | arXiv
 
[7]
2022 | Conference Paper | LibreCat-ID: 33848 | OA
Cord-Landwehr, Tobias, Christoph Boeddeker, Thilo von Neumann, Catalin Zorila, Rama Doddipatla, and Reinhold Haeb-Umbach. “Monaural Source Separation: From Anechoic to Reverberant Environments.” In 2022 International Workshop on Acoustic Signal Enhancement (IWAENC). Bamberg: IEEE, 2022.
LibreCat | Files available | arXiv
 
[6]
2022 | Misc | LibreCat-ID: 33816 | OA
Gburrek, Tobias, Christoph Boeddeker, Thilo von Neumann, Tobias Cord-Landwehr, Joerg Schmalenstroeer, and Reinhold Haeb-Umbach. A Meeting Transcription System for an Ad-Hoc Acoustic Sensor Network. arXiv, 2022. https://doi.org/10.48550/ARXIV.2205.00944.
LibreCat | Files available | DOI
 
[5]
2021 | Conference Paper | LibreCat-ID: 26770 | OA
Neumann, Thilo von, Keisuke Kinoshita, Christoph Boeddeker, Marc Delcroix, and Reinhold Haeb-Umbach. “Graph-PIT: Generalized Permutation Invariant Training for Continuous Separation of Arbitrary Numbers of Speakers.” In Interspeech 2021, 2021. https://doi.org/10.21437/interspeech.2021-1177.
LibreCat | Files available | DOI
 
[4]
2021 | Conference Paper | LibreCat-ID: 29173 | OA
Neumann, Thilo von, Christoph Boeddeker, Keisuke Kinoshita, Marc Delcroix, and Reinhold Haeb-Umbach. “Speeding Up Permutation Invariant Training for Source Separation.” In Speech Communication; 14th ITG Conference, 2021.
LibreCat | Files available
 
[3]
2020 | Conference Paper | LibreCat-ID: 20762 | OA
Neumann, Thilo von, Keisuke Kinoshita, Lukas Drude, Christoph Boeddeker, Marc Delcroix, Tomohiro Nakatani, and Reinhold Haeb-Umbach. “End-to-End Training of Time Domain Audio Separation and Recognition.” In ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 7004–8, 2020. https://doi.org/10.1109/ICASSP40776.2020.9053461.
LibreCat | Files available | DOI
 
[2]
2020 | Conference Paper | LibreCat-ID: 20764 | OA
Neumann, Thilo von, Christoph Boeddeker, Lukas Drude, Keisuke Kinoshita, Marc Delcroix, Tomohiro Nakatani, and Reinhold Haeb-Umbach. “Multi-Talker ASR for an Unknown Number of Sources: Joint Training of Source Counting, Separation and ASR.” In Proc. Interspeech 2020, 3097–3101, 2020. https://doi.org/10.21437/Interspeech.2020-2519.
LibreCat | Files available | DOI
 
[1]
2020 | Conference Paper | LibreCat-ID: 20766 | OA
Kinoshita, Keisuke, Thilo von Neumann, Marc Delcroix, Tomohiro Nakatani, and Reinhold Haeb-Umbach. “Multi-Path RNN for Hierarchical Modeling of Long Sequential Data and Its Application to Speaker Stream Separation.” In Proc. Interspeech 2020, 2652–56, 2020. https://doi.org/10.21437/Interspeech.2020-2388.
LibreCat | Files available | DOI
 

Search

Filter Publications

Display / Sort

Export / Embed